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Please cancel claims 1, 5 and 9. 
Please amend the claims as follows. 

1. (Cancelled) 

2. (Currently Amended) The apparatus of claim 1 further comprising An apparatus 
comprising: 

at least one processor; 

a memory coupled to the at least one processor; 

a database table residing in the memory; 

a cardinality estimator residing in the memory and executed by the at least one 

processor, the cardinality estimator estimating cardinality of an intermediate dataset that 
satisfies a query to the database tabic in a manner that accounts for data skew in the 
database table; 

a query optimizer residing in the memory that uses the estimated cardinality from 

the cardinality estimator to optimize the query; and 

a frequent values list residing in the memory that contains a list of values in the 
database table, each value having a corresponding frequency, wherein the cardinality 
estimator estimates the cardinality of the intermediate dataset by determining whether a 
frequency corresponding to a value exceeds a predetermined threshold, and if the 
frequency exceeds the predetermined threshold, accounting for the corresponding value, 
and if the frequency does not exceed the predetermined threshold, using a formula to 
estimate the cardinality of the intermediate dataset, the formula accounting for data skew 
in the database table by subtracting the frequency of all values above the predetermined 
threshold in the frequent values list that satisfy the query from the total number of rows in 
the database table intermediate dataset . 
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1 3. (Original) The apparatus of claim 2 wherein the cardinality estimator estimates the 

2 cardinality Ca' of the intermediate dataset using the formula: 



3 Ca'=P + M(l-(l~) T ) 

M 

4 where 

5 M = Ca - (P+Q) 

6 P = number of distinct values in the frequent values list above the 

7 predetermined threshold that satisfy the query; 

8 Q = number of distinct values in the frequent values list above the 

9 predetermined threshold that do not satisfy the query; 

10 Ca = cardinality of the database table; 

11 Y = X-Fi; 

12 X = number of rows in the intermediate dataset; and 

13 Fi = sum of frequencies of values in the frequent values list above the 



14 predetermined threshold that satisfy the query. 
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1 4. (Previously Presented) An apparatus comprising: 

2 at least one processor; 

3 a memory coupled to the at least one processor; 

4 a database table residing in the memory; 

5 a frequent values list residing in the memory that contains a list of values in the 

6 database table, each value having a corresponding frequency; and 

7 a cardinality estimator residing in the memory and executed by the at least one 

8 processor, the cardinality estimator estimating cardinality of an intermediate dataset for a 

9 query to the database table using the following formula: 

10 Ca'= P + M(l-(\ — -) Y ) 

M 

1 1 where 

12 M = Ca - (P+Q) 

13 P = number of distinct values in the frequent values list above a 

14 predetermined threshold that satisfy the query; 

15 Q = number of distinct values in the frequent values list above the 

1 6 predetermined threshold that do not satisfy the query; 

17 Ca = cardinality of the database table; 

18 Y = X-Fi; 

19 X = number of rows in the intermediate dataset; and 

20 Fi = sum of frequencies of values in the frequent values list above the 

2 1 predetermined threshold that satisfy the query; and 

22 a query optimizer residing in the memory that uses the estimated cardinality of the 

23 intermediate dataset to optimize the query. 



4 



1 5. (Cancelled) 

1 6. (Currently Amended) The method of claim 5 wherein step (D) includes the steps of : 

2 A method for optimizing a query to a database table by estimating cardinality of 

3 an intermediate dataset that results from processing the query, the method comprising the 

4 steps of: 

5 (A) evaluating the query: 

6 (B) estimating cardinality of the intermediate dataset using a formula that 

7 accounts for data skew in the database table by performing the steps of: 

8 selecting a value in a frequent values list that contains a list of values in 

9 the database table, each value having a corresponding frequency; 

10 if the selected value has a corresponding frequency that exceeds a 

1 1 predetermined threshold, incrementing the cardinality estimate by one; and 

12 if the frequency docs not exceed the predetermined threshold, using a 

13 formula to estimate the cardinality of the intermediate dataset, the formula 

14 accounting for data skew in the database table by subtracting the frequency of all 

15 values above the predetermined threshold in the frequent value table that satisfy 

1 6 the query from the total number of rows in the database table intermediate dataset; 

17 and 

18 (C) using the cardinality estimate in step (B) to optimize the query . 
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1 7. (Previously Presented) The method of claim 6 wherein the cardinality estimator 

2 estimates the step of estimating the cardinality Ca' of the intermediate dataset in step (B) 

3 uses the formula: 



4 Ca'=P + M(l-(l--V) 

M 

5 where 

6 M = Ca - (P+Q) 

7 P = number of distinct values in the frequent values list above the 

8 predetermined threshold that satisfy the query; 

9 Q = number of distinct values in the frequent values list above the 

10 predetermined threshold that do not satisfy the query; 

11 Ca = cardinality of the database table; 

12 Y = X-Fi; 

13 X = number of rows in the intermediate dataset; and 

14 Fi = sum of frequencies of values in the frequent values list above the 



1 5 predetermined threshold that satisfy the query. 
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1 8. (Previously Presented) A method for optimizing a query to a database table by 

2 estimating cardinality of an intermediate dataset that results from processing the query on 

3 a database table, the method comprising the steps of: 

4 (A) evaluating the query; 

5 (B) estimating the cardinality Ca' of the intermediate dataset using the formula: 

6 Ca'=P+M(l-(l — -) Y ) 

M 

7 where 

8 M = Ca - (P+Q) 

9 P = number of distinct values in the frequent values list above a 

10 predetermined threshold that satisfy the query; 

11 Q = number of distinct values in the frequent values list above the 

12 predetermined threshold that do not satisfy the query; 

13 Ca = cardinality of the database table; 

14 Y = X-Fi; 

15 X = number of rows in the intermediate dataset; and 

16 Fi = sum of frequencies of values in the frequent values list above the 

17 predetermined threshold that satisfy the query; and 

1 8 (C) using the cardinality estimate in step (B) to optimize the query. 
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1 9-11 (Cancelled) 

1 12. (Currently Amended) The program product of claim 9 A computer-readable program 

2 product comprising: 

3 (A) cardinality estimator estimating cardinality of an intermediate dataset that 

4 satisfies a query to a database table in a manner that accounts for data skew in the 

5 database table, wherein the cardinality estimator evaluates a frequent values list that 

6 contains a list of values in the database table, each value having a corresponding 

7 frequency, wherein the cardinality estimator estimates the cardinality of the intermediate 

8 dataset by determining whether a frequency corresponding to a value exceeds a 

9 predetermined threshold, and if the frequency exceeds the predetermined threshold, 

10 accounting for the corresponding value, and if the frequency does not exceed the 

1 1 predetermined threshold, using a formula to estimate the cardinality of the intermediate 

12 dataset, the formula accounting for data skew in the database table by subtracting the 

13 frequency of all values above the predetermined threshold in the frequent values list that 

14 satisfy the query from the total number of rows in the database tabic intermediate dataset ; 

15 (B) a query optimizer that uses the estimated cardinality from the cardinality 

16 estimator to optimize the query; and 

17 (O recordable media bearing the cardinality estimator and the query optimizer . 
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1 13. (Previously Presented) The program product of claim 12 wherein the cardinality 

2 estimator estimates the cardinality Ca' of the intermediate dataset using the formula: 



3 Ca'=P+M(l-(l--V) 

M 

4 where 

5 M = Ca - (P+Q) 

6 P = number of distinct values in the frequent values list above the 

7 predetermined threshold that satisfy the query; 

8 Q = number of distinct values in the frequent values list above the 

9 predetermined threshold that do not satisfy the query; 

10 Ca = cardinality of the database table; 

11 Y = X-Fi; 

12 X = number of rows in the intermediate dataset; and 

13 Fi = sum of frequencies of values in the frequent values list above the 



14 predetermined threshold that satisfy the query. 
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1 14. (Currently Amended) A computer-readable program product comprising: 

2 (A) a cardinality estimator that estimates cardinality of an intermediate dataset for 

3 a query to a database table using the following formula: 

4 Ca . = P +M(1 _ ( l_J_)^ 

M 

5 where 

6 M = Ca - (P+Q) 

7 P = number of distinct values in a frequent values list above a 

8 predetermined threshold that satisfy the query; 

9 Q = number of distinct values in the frequent values list above the 

10 predetermined threshold that do not satisfy the query; 

11 Ca = cardinality of the database table; 

12 Y = X-Fi; 

13 X = number of rows in the intermediate dataset; and 

14 Fi = sum of frequencies of values in the frequent values list above the 

1 5 predetermined threshold that satisfy the query; 

16 (B) a query optimizer that uses the estimated cardinality from the cardinality 

17 estimator to optimize the query; and 

1 8 (C) recordable media bearing the cardinality estimator and the query optimizer. 
1 15. (Cancelled) 

1 16. (Cancelled) 



10 



STATUS OF THE CLAIMS 



Claims 1-16 were originally filed in this patent application. In response to a first 
office action dated 05/19/2006, an amendment was filed on 08/18/2006 that cancelled 
claims 10-11 and 15-16, and amended claims 1-2, 4-9 and 12-14. In the pending final 
office action, claims 2, 3, 6, 7, 12 and 13 were rejected under 35 U.S.C. §1 12, second 
paragraph. Claims 9 and 12-14 were rejected under 35 U.S.C. §101. Claims 1, 5 and 9 
were rejected under 35 U.S.C. § 103(a) as being unpatentable over U.S. Patent 
Application Publication 2004/0059743 to Burger in view of Applicant's Admitted Prior 
Art (AAPA). Claims 4 and 8 were allowed. In this amendment, claims 1, 5 and 9 have 
been cancelled, and claims 2, 6, 12 and 14 have been amended. Claims 2-4, 6-8 and 12- 
14 are currently pending. 
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